Picture for Lei Jiang

Lei Jiang

MOSAIC: Modular Orchestration for Structured Agentic Intelligence and Composition

Add code
May 30, 2026
Viaarxiv icon

Knowledge Dependency Estimation for Reliable Question Answering

Add code
May 27, 2026
Viaarxiv icon

LLMSpace: Carbon Footprint Modeling for Large Language Model Inference on LEO Satellites

Add code
May 07, 2026
Viaarxiv icon

Negative Advantage Is a Double-Edged Sword: Calibrating Advantage in GRPO for Deep Search

Add code
Apr 20, 2026
Viaarxiv icon

Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning

Add code
Apr 15, 2026
Viaarxiv icon

Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models

Add code
Apr 11, 2026
Viaarxiv icon

Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty

Add code
Apr 11, 2026
Viaarxiv icon

Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity

Add code
Mar 18, 2026
Viaarxiv icon

TGM-VLA: Task-Guided Mixup for Sampling-Efficient and Robust Robotic Manipulation

Add code
Feb 28, 2026
Viaarxiv icon

Dialogue Model Optimization via Agent Game and Adaptive Tree-based GRPO

Add code
Feb 09, 2026
Viaarxiv icon